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Abstract. Quantum ESPRESSO is an integrated suite of computer codes for electronic- 
structure calculations and materials modeling, based on density-functional theory, plane 
waves, and pseudopotentials (norm-conserving, ultrasoft, and projector-augmented wave). 
Quantum ESPRESSO stands for opEn Source Package for Research in Electronic Structure, 
Simulation, and Optimization. It is freely available to researchers around the world under 
the terms of the GNU General Public License. Quantum ESPRESSO builds upon newly- 
restructured electronic-structure codes that have been developed and tested by some of the 
original authors of novel electronic-structure algorithms and applied in the last twenty years 
by some of the leading materials modehng groups worldwide. Innovation and efficiency are 
still its main focus, with special attention paid to massively-parallel architectures, and a great 
effort being devoted to user friendliness. QUANTUM ESPRESSO is evolving towards a 
distribution of independent and inter-operable codes in the spirit of an open-source project, 
where researchers active in the field of electronic-structure calculations are encouraged to 
participate in the project by contributing their own codes or by implementing their own ideas 
into existing codes. 
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1. Introduction 

The combination of methodological and algorithmic innovations and ever-increasing 
computer power is delivering a simulation revolution in materials modeling, starting from 
the nanoscale up to bulk materials and devices [HI. Electronic- structure simulations based 
on density-functional theory (DFT) [|2l S lU have been instrumental to this revolution, and 
their application has now spread outside a restricted core of researchers in condensed-matter 
theory and quantum chemistry, involving a vast community of end users with very diverse 
scientific backgrounds and research interests. Sustaining this revolution and extending its 
beneficial effects to the many fields of science and technology that can capitalize on it 
represents a multifold challenge. In our view it is also a most urgent, fascinating and fruitful 
endeavor, able to deliver new forms for scientific exploration and discovery, where a very 
complex infrastructure — made of software rather than hardware — can be made available to 
any researcher, and whose capabilities continue to increase thanks to the methodological 
innovations and computing power scalability alluded to above. 

Over the past few decades, innovation in materials simulation and modeling has resulted 
from the concerted efforts of many individuals and groups worldwide, often of small size. 
Their success has been made possible by a combination of competences, ranging from the 
ability to address meaningful and challenging problems, to a rigorous insight into theoretical 
methods, ending with a marked sensibility to matters of numerical accuracy and algorithmic 
efficiency. The readiness to implement new algorithms that utilize novel ideas requires 
total control over the software being used-for this reason, the physics community has long 
relied on in-house computer codes to develop and implement new ideas and algorithms. 
Transitioning these development codes to production tools is nevertheless essential, both 
to extensively validate new methods and to speed up their acceptance by the scientific 
community. At the same time, the dissemination of codes has to be substantial, to justify 
the learning efforts of PhD students and young postdocs who would soon be confronted 
with the necessity of deploying their competences in different research groups. In order to 
sustain innovation in numerical simulation, we believe there should be little, if any, distinction 
between development and production codes; computer codes should be easy to maintain, to 
understand by different generations of young researchers, to modify, and extend; they should 
be easy to use by the layman, as well as general and flexible enough to be enticing for 
a vast and diverse community of end users. One easily understands that such conflicting 
requirements can only be tempered, if anything, within organized and modular software 
projects. 

Software modularity also comes as a necessity when complex problems in complex 
materials need to be tackled with an array of different methods and techniques. Multiscale 
approaches, in particular, strive to combine methods with different accuracy and scope to 
describe different parts of a complex system, or phenomena occurring at different time and/or 
length scales. Such approaches will require software packages that can perform different 
kinds of computations on different aspects of the same problem and/or different portions of 
the same system, and that allow for interoperabihty or joint usage of the different modules. 
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Different packages should at the very least share the same input/output data formats; ideally 
they should also share a number of mathematical and application libraries, as well as the 
internal representation of some of the data structures on which they operate. Individual 
researchers or research groups find it increasingly difficult to meet all these requirements 
and to continue to develop and maintain in-house software project of increasing complexity. 
Thus, different and possibly collaborative solutions should be sought. 

A successful example comes from the software for simulations in quantum chemistry, 
that has often (but not always) evolved towards commercialization: the development and 
maintenance of most well-known packages is devolved to non-profit ||5l[6l|7l[8l or commercial 
191 [ISl [m im companies. The software is released for a fee under some proprietary license 
that may impose several restrictions to the availability of sources (computer code in a high- 
level language) and to what can be done with the software. This model has worked well, and 
is also used by some of the leading development groups in the condensed-matter electronic- 
structure community lfT3l[T4ll . while some proprietary projects allow for some free academic 
usage of their products [[151 QSl [l2l [SI [13 [Ml. A commercial endeavor also brings the 
distinctive advantage of a professional approach to software development, maintenance, 
documentation, and support. 

We believe however that a more interesting and fruitful alternative can be pursued, and 
one that is closer to the spirit of science and scientific endeavor, modeled on the experience of 
the open-source software community. Under this model, a large community of users has full 
access to the source code and the development material, under the coordination of a smaller 
group of core developers. In the long term, and in the absence of entrenched monopolies, 
this strategy could be more effective in providing good software solutions and in nurturing 
a community engaged in providing those solutions, as compared to the proprietary software 
strategy. In the case of software for scientific usage, such an approach has the additional, and 
by no means minor, advantage to be in line with the tradition and best practice of science, that 
require reproducibility of other people's results, and where collaboration is no less important 
than competition. 

In this paper we will shortly describe our answer to the above-mentioned problems, 
as embodied in our Quantum ESPRESSO project (indeed, ESPRESSO stands for opEn 
Source Package for Research in Electronic Structure, Simulation, and Optimization). First, 
in Sec. 2, we describe the guiding lines of our effort. In Sec. 3, we give an overview of 
the current capabilities of Quantum ESPRESSO. In Sec. 4, we provide a short description 
of each software component presently distributed within Quantum ESPRESSO. In Sec. 5 
we give an overview of the parallelization strategies followed and implemented in QUANTUM 
ESPRESSO. Finally, Sec. 6 describes current developments and offers a perspective outlook. 
The Appendix sections discuss some of the more specific technical details of the algorithms 
used, that have not been documented elsewhere. 
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2. The Quantum ESPRESSO project 

Quantum ESPRESSO is an integrated suite of computer codes for electronic-structure 
calculations and materials modeling based on density-functional theory, plane waves basis 
sets and pseudopotentials to represent electron-ion interactions. QUANTUM ESPRESSO is 
free, open-source software distributed under the terms of the GNU General Public License 
(GPL) W^. 

The two main goals of this project are to foster methodological innovation in the field 
of electronic- structure simulations and to provide a wide and diverse community of end 
users with highly efficient, robust, and user-friendly software implementing the most recent 
innovations in this field. Other open-source projects [jlTl l22l l23l l24l 1251 exist, besides 
Quantum ESPRESSO, that address electronic -structure calculations and various materials 
simulation techniques based on them. Unlike some of these projects. Quantum ESPRESSO 
does not aim at providing a single monolithic code able to perform several different tasks by 
specifying different input data to a same executable. Our general philosophy is rather that of 
an open distribution, i.e. an integrated suite of codes designed to be interoperable, much in the 
spirit of a Linux distribution, and thus built around a number of core components designed and 
maintained by a small group of core developers, plus a number of auxiliary/complementary 
codes designed, implemented, and maintained by members of a wider community of users. 
The distribution can even be redundant, with different applications addressing the same 
problem in different ways; at the end, the sole requirements that Quantum ESPRESSO 
components must fulfill are that: i) they are distributed under the same GPL license agreement 
[|20| as the other QUANTUM ESPRESSO components; zz) they are fully interoperable with 
the other components. Of course, they need to be scientifically sound, verified and validated. 
External contributors are encouraged to join the Quantum ESPRESSO project, if they 
wish, while maintaining their own individual distribution and advertisement mode for their 
software (for instance, by maintaining individual web sites with their own brand names ll26ll ). 
To facilitate this, a web service called qe-f orge [(27l . described in the next subsection, has 
been recently put in place. 

Interoperability of different components within Quantum ESPRESSO is granted by 
the use of common formats for the input, output, and work files. In addition, external 
contributors are encouraged, but not by any means forced, to use the many numerical 
and application libraries on which the core components are built. Of course, this general 
philosophy must be seen more as an objective to which a very complex software project tends, 
rather than a starting point. 

One of the main concerns that motivated the birth of the Quantum ESPRESSO project 
is high performance, both in serial and in parallel execution. High serial performance across 
different architectures is achieved by the systematic use of standardized mathematical libraries 
(BLAS, LAPACK [jlBl, and FFTW [29]) for which highly optimized implementations exist 
on many platforms; when proprietary optimizations of these libraries are not available, the 
user can compile the library sources distributed with Quantum ESPRESSO. Optimal 
performance in parallel execution is achieved through the design of several parallelization 
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levels, using sophisticated communication algorithms, whose implementation often does not 
need to concern the developer, being embedded and concealed in appropriate software layers. 
As a result the performance of the key engines, PWscf (Sec. 4.1 ) and CP (Sec. 4.2) may 
scale on massively parallel computers up to thousands of processors. 

The distribution is organized into a basic set of modules, libraries, installation utilities, 
plus a number of directories, each containing one or more executables, performing specific 
tasks. The communications between the different executables take place via data files. We 
think that this kind of approach lowers the learning barrier for those who wish to contribute 
to the project. The codes distributed with Quantum ESPRESSO, including many auxiliary 
codes for the post-processing of the data generated by the simulations, are easy to install 
and to use. The GNU configure and make utilities ensure a straightforward installation 
on many different machines. Applications are run through text input files based on Fortran 
namelists, that require the users to specify only an essential but small subset of the many 
control variables available; a specialized graphical user interface (GUI) that is provided with 
the distribution facilitates this task for most component programs. It is foreseen that in the 
near future the design of special APIs (Application Programming Interfaces) will make it 
easier to glue different components of the distribution together and with external applications, 
as well as to interface them to other, custom-tailored, GUIs and/or command interpreters. 

The Quantum ESPRESSO distribution is written, mostly, in Fortran-95, with some 
parts in C or in Fortran-77. Fortran-95 offers the possibility to introduce advanced 
programming techniques without sacrificing the performances. Moreover Fortran is still the 
language of choice for high-performance computing and it allows for easy integration of 
legacy codes written in this language. A single source tree is used for all architectures, with C 
preprocessor options selecting a small subset of architecture-dependent code. Parallelization 
is achieved using the Message-Passing paradigm and calls to standard MPI (Message Passing 
Interface) OOll libraries. Most calls are hidden in a few routines that act as an intermediate 
layer, accomplishing e.g. the tasks of summing a distributed quantity over processors, of 
collecting distributed arrays or distributing them across processors, and to perform parallel 
three-dimensional Fast Fourier Transforms (EFT). This allows to develop straightforwardly 
and transparently new modules and functionalities that preserve the efficient parallelization 
backbone of the codes. 



2.1. QE-forge 

The ambition of the Quantum ESPRESSO project is not limited to providing highly 
efficient and user-friendly software for large-scale electronic- structure calculations and 
materials modeling. Quantum ESPRESSO aims at promoting active cooperation between 
a vast and diverse community of scientists developing new methods and algorithms in 
electronic-structure theory and of end users interested in their application to the numerical 
simulation of materials and devices. 

As mentioned, the main source of inspiration for the model we want to promote is the 
successful cooperative experience of the GNU/Linux developers' and users' community. One 
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of the main outcomes of this community has been the incorporation within the GNU/Linux 
operating system distributions of third-party software components, which, while being 
developed and maintained by autonomous, and often very small, groups of users, are put at 
the disposal of the entire community under the terms of the GPL. The community, in turn, 
provides positive feedback and extensive validation by benchmarking new developments, 
reporting bugs, and requesting new features. These developments have largely benefited 
from the SourceForge code repository and software development service OTll . or by other 
similar services, such as RubyForge, Tigris.org, BountySource, BerliOS, JavaForge, and GNU 
Savannah. 

Inspired by this model, the QUANTUM ESPRESSO developers' and users' community 
has set up its own web portal, named qe-forge [1271 . The goal of qe-forge is to 
complement the traditional web sites of individual scientific software projects, which are 
passive instruments of information retrieval, with a dynamical space for active content 
creation and sharing. Its aim is to foster and simplify the coordination and integration of the 
programming efforts of heterogeneous groups and to ease the dissemination of the software 
tools thus obtained. 

qe-forge provides, through a user-friendly web interface, an integrated development 
environment, whereby researchers can freely upload, manage and maintain their own 
software, while retaining full control over it, including the right of not releasing it. The 
services so far available include source-code management software (CVS or SVN repository), 
mailing lists, public forums, bug tracking facilities, up/down-load space, and wiki pages for 
projects' documentation, qe-forge is expected to be the main tool by which Quantum 
ESPRESSO end users and external contributors can maintain Quantum ESPRESSO- 
related projects and make them available to the community. 

2.2. Educational usage of Quantum ESPRESSO 

Training on advanced simulation techniques using the Quantum ESPRESSO distribution 
is regularly offered at SISSA to first-year graduate students within the electronic structure 
course. The scope of this course is not limited to the opportunities that modern simulation 
techniques based on electronic-structure theory offer to molecular and materials modeling. 
Emphasis is put onto the skills that are necessary to turn new ideas into new algorithms and 
onto the methods that are needed to validate the implementation and application of computer 
simulation methods. Based on this experience, the Quantum ESPRESSO developers' 
group offers on a regular basis training courses to graduate students and young researchers 
worldwide, also in collaboration with the Abdus Salam International Centre for Theoretical 
Physics, which operates under the aegis of the UNESCO and IAEA agencies of the UNO. 

The Quantum ESPRESSO distribution is used not only for graduate, but also 
for undergraduate training. At MIT, for example, it is one of the teaching tools in 
the class Introduction to Modeling and Simulations — an institute-wide course offered to 
undergraduates from the School of Science and the School of Engineering. The challenge 
here is to provide students of different backgrounds with an overview of numerical simulations 
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methods to study properties of real materials. For many undergraduates, this represents the 
first experience of computers used as scientific tools. To facihtate the access and use of 
Quantum ESPRESSO a user-friendly web interface has been developed at MIT, based on 
the GenePattem portal, that allows direct access to the code, thus removing the need to use 
a Unix/Linux environment or the details of the job queueing and submission procedure. The 
user utilizes a web browser (see Fig. [T]) to build input files and view the outputs of simulations, 
and to perform calculations from wherever Internet access is available. The calculations run 
on dedicated computer clusters where the code has been previously installed and tested. 

Using a web interface to easily access computational resources and share them among 
different users naturally points to the concept of cloud computing, and the previous model was 
tested at MIT in the Spring 2009, wholly based on a cluster of virtual machines on Amazon's 
Elastic Compute Cloud (EC2) web service. Our experience shows that when compared to the 
cost of purchasing, maintaining and administering computer clusters, the use of web-based 
computational resources becomes a very appealing and affordable option. It is particularly 
suited for classroom instruction, where advanced computational performance is not required, 
and it allows for easy transferability of this resource across universities. 
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Figure 1. Snapshot of the web interface used for undergraduate teaching at MIT. The software 
has been developed at the MIT's Office of Educational Innovation and Technology. 



3. Short description of Quantum ESPRESSO 

Quantum ESPRESSO implements a variety of methods and algorithms aimed at a 
chemically realistic modeling of materials from the nanoscale upwards, based on the solution 
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of the density-functional theory (DFT) [l2l[3]| problem, using a plane waves (PW) basis set and 
pseudopotentials (PP) [|32l to represent electron-ion interactions. 

The codes are constructed around the use of periodic boundary conditions, which allows 
for a straightforward treatment of infinite crystalline systems, and an efficient convergence to 
the thermodynamic limit for aperiodic but extended systems, such as liquids or amorphous 
materials. Finite systems are also treated using supercells; if required, open-boundary 
conditions can be used through the use of the density-countercharge method [[33l. QUANTUM 
ESPRESSO can thus be used for any crystal structure or supercell, and for metals as well as 
for insulators. The atomic cores can be described by separable [34J norm-conserving (NC) 
PPs [)35l|, ultra-soft (US) PPs 1361, or by projector-augmented wave (PAW) sets [37 1. Many 
different exchange-correlation functionals are available in the framework of the local-density 
(LDA) or generalized-gradient approximation (GGA) 08II . plus advanced functionals like 
Hubbard U corrections and few meta-GGA [39| and hybrid functionals [|40l SB 321. The 
latter is an area of very active development, and more details on the implementation of hybrid 
functionals and related Fock-exchange techniques are given in Appendix A.5. 

The basic computations/simulations that can be performed include: 

• calculation of the Kohn-Sham (KS) orbitals and energies [|43l for isolated or 
extended/periodic systems, and of their ground-state energies; 

• complete structural optimizations of the microscopic (atomic coordinates) and 
macroscopic (unit cell) degrees of freedom, using Hellmann-Feynman forces p4l l45l 
and stresses [|46l : 

• ground states for magnetic or spin-polarized system, including spin-orbit coupling P7l 
and non-coUinear magnetism P^l49ll : 

• ab initio molecular dynamics (MD), using either the Car-Parrinello Lagrangian [|50l or 
the Hellmann-Feynman forces calculated on the Bom-Oppenheimer (BO) surface [15T]| . 
in a variety of thermodynamical ensembles, including NPT variable-cell [|52il53l MD; 

• density-functional perturbation theory (DFPT) [1541 [55] [561 , to calculate second and 
third derivatives of the total energy at any arbitrary wavelength, providing phonon 
dispersions, electron-phonon and phonon-phonon interactions, and static response 
functions (dielectric tensors. Born effective charges, IR spectra, Raman tensors); 

• location of saddle points and transition states via transition-path optimization using the 
nudged elastic band (NEB) method [l57l [581 [591 ; 

• ballistic conductance within the Landauer-Buttiker theory using the scattering approach 

mt 

• generation of maximally localized Wannier functions [[6Tl[62l and related quantities; 

• calculation of nuclear magnetic resonance (NMR) and electronic paramagnetic 
resonance (EPR) parameters Il63l[64l; 

• calculation of K-edge X-ray absorption spectra J65l. 

Other more advanced or specialized capabilities are described in the next sections, while 
ongoing projects (e.g. time-dependent DFT and many-body perturbation theory) are 
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mentioned in the last section. Selected applications were described in Ref [|66l . Several 
utilities for data post-processing and interfacing to advanced graphic applications are 
available, allowing e.g. to calculate scanning tunneling microscopy (STM) images [67], the 
electron localization function (ELF) [1681I . Lowdin charges [|69J , the density of states (DOS), 
and planar [,70J or spherical averages of the charge and spin densities and potentials. 

3.1. Data file format 

The interoperability of different software components within a complex project such as 
Quantum ESPRESSO relies on the careful design of file formats for data exchange. A 
rational and open approach to data file formats is also essential for interfacing applications 
within Quantum ESPRESSO with third-party applications, and more generally to make 
the results of lengthy and expensive computer simulations accessible to, and reproducible 
by, the scientific community at large. The need for data file formats that make data exchange 
easier than it is now is starting to be widely appreciated in the electronic-structure community. 
This problem has many aspects and likely no simple, "one-size-fits-all", solution. Data files 
should ideally be 

• extensible: one should be able to add some more information to a file without breaking 
all codes that read that file; 

• self-documenting: it should be possible to understand the contents of a file without too 
much effort; 

• efficient: with data size in the order of GBytes for large-scale calculations, slow or 
wasteful I/O should be avoided. 

The current trend in the electronic-structure community seems to be the adoption of one of 
the following approaches: 

• Structured file formats, notably Hierarchical Data Format (HDF) [TTTII and network 
Common Data Form (netCDF) iTTlll . that have been widely used for years in other 
communities; 

• file formats based on the Extensible Markup Language (XML) [17311 . 

It is unlikely that a common, standardized data format will ever prevail in our community. We 
feel that we should focus, rather than on standardization, on an approach that allows an easy 
design and usage of simple and reliable converters among different data formats. Prompted 
by these considerations. Quantum ESPRESSO developers have opted for a simple solution 
that tries to combine the advantages of both the above-mentioned approaches. A single file 
containing all the data of a simulation is replaced by a data directory, containing several files 
and subdirectories, much in the same way as it is done in the Mac OS X operating system. The 
"head" file contains data written with ordinary Fortran formatted I/O, identified by XML tags. 
Only data of small size, such as atomic positions, parameters used in the calculation, one- 
electron and total energies, are written in the head file. Data of potentially large size, such as 
PW coefficients of KS orbitals, charge density, and potentials, are present as links to separate 
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files, written using unformatted Fortran 1/0. Data for each k-point are written to a separate 
subdirectory. A lightweight library called iotk, standing for Input/Output ToolKit [1741 . is 
used to read and write the data directory. 

Another problem affecting interoperability of PW-PP codes is the availability of data 
files containing atomic PP's — one of the basic ingredients of the calculation. There are many 
different types of PP's, many different codes generating PP's (see e.g. Ref 117511761 17711 ). each 
one with its own format. Again, the choice has fallen on a simple solution that makes it easy to 
write converters from and to the format used by QUANTUM ESPRESSO. Each atomic PP is 
contained in a formatted file (efficiency is not an issue here), described by a XML-like syntax. 
The resulting format has been named Unified Pseudopotential File (UPF). Several converters 
from other formats to the UPF format are available in Quantum ESPRESSO. 

4. Quantum ESPRESSO packages 

The complete Quantum ESPRESSO distribution is rather large. The current 4.1 version 
includes about 310,000 lines of Fortran-90 code, 1,000 lines of Fortran-77 code, 1,000 lines 
of C code, 2000 lines of Tel code, plus parts of external libraries such as FFTW, BLAS, 
LAPACK and the external toolkit iotk. In addition, there are approx. 10,000 lines of specific 
documentation (not counting different formats), more than 100 different examples and more 
than 100 tests of the different functionalities. Overall the complete distribution includes more 
than 3000 files, organized into 200 directories, and takes 22Mb in compressed format. 

With such a sizable code basis, modularization becomes necessary. Quantum 
ESPRESSO is presently divided into several executables, performing different types of 
calculations, although some of them have overlapping functionalities. Typically there is 
a single set of functions/subroutines or a single Fortran 90 module that performs each 
specific task (e.g. matrix diagonalizations, or potential updates), but there are still important 
exceptions to this rule, reflecting the different origin and different styles of the original 
components. Quantum ESPRESSO has in fact been built out of the merge and re- 
engineering of different packages, that were independently developed for several years. In 
the following, the main components are briefly described. 

4.1. PWscf 

PWscf implements an iterative approach to reach self-consistency, using at each step iterative 
diagonalization techniques, in the framework of the plane-wave pseudopotential method. An 
early version of PWscf is described in Ref [78J. 

Both separable NC-PPs and US-PPs are implemented; recently, also the projector 
augmented- wave method [l37ll has been added, largely following the lines of Ref [1791 for 
its implementation. In the case of US-PPs, the electronic wave functions can be made 
smoother at the price of having to augment their square modulus with additional contributions 
to recover the actual physical charge densities. For this reason, the charge density is more 
structured than the square of the wavefunctions, and requires a larger energy cutoff for its 
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plane wave expansion (typically, 6 to 12 times larger; for a NC-PP, a factor of 4 would be 
mathematically sufficient). Hence, different real-space Fourier grids are introduced - a "soft" 
one that represents the square of electronic wave functions, and a "hard" one that represents 
the charge density [[80118^1 J . The augmentation terms can be added either in reciprocal space 
(using an exact but expensive algorithm) or directly in real space (using an approximate but 
faster algorithm that exploits the local character of the augmentation charges). 

PWscf can use the well established LDA and GGA exchange-correlation functional, 
including spin-polarization within the scheme proposed in Ref [|82l and can treat non-coUinear 
magnetism [|48l |49l as e.g. induced by relativistic effects (spin-orbit interactions) [l83ll84]| or 
by complex magnetic interactions (e.g. in the presence of frustration). DFT -i- Hubbard U 
calculations fi85fl are implemented for a simplified ("no- J") rotationally invariant form [[86l 
of the Hubbard term. Other advanced functionals include TPSS meta-GGA [|391 , functional 
with finite-size corrections fWl\, and the PBEO [40] and B3LYP [4ll|42l hybrids. 

Self-consistency is achieved via the modified Broyden method of Ref [j88l, with some 
further refinements that are detailed in Appendix A.l. The sampling of the Brillouin Zone 
(BZ) can be performed using either special [|89l l90l k-points provided in input or those 
automatically calculated starting from a uniform grid. Crystal symmetries are automatically 
detected and exploited to reduce computational costs, by restricting the sampling of the BZ 
to the irreducible wedge alone (See Appendix A.4). When only the T point (k = 0) is used, 
advantage is taken of the real character of the KS orbitals, allowing one to store just half of 
the Fourier components. BZ integrations in metallic systems can be performed using a variety 
of smearing/broadening techniques, such as Fermi-Dirac, Gaussian, Methfessel-Paxton [| 9T1l . 
and Marzari-Vanderbilt cold smearing [|92l . The tetrahedron method [|93l is also implemented. 
Finite-temperature effects on the electronic properties can be easily accounted for by using 
the Fermi-Dirac smearing as a practical way of implementing the Mermin finite-temperature 
density-functional approach [|94ll . 

Structural optimizations are performed using the Broyden-Fletcher-Goldfarb-Shanno 
(BEGS) algorithm [|95l |96l |97l or damped dynamics; these can involve both the internal, 
microscopic degrees of freedom (i.e. the atomic coordinates) and/or the macroscopic ones 
(shape and size of the unit cell). The calculation of minimum-energy paths, activation 
energies, and transition states uses the Nudged Elastic Band (NEB) method |'57'|. Potential 
energy surfaces as a function of suitably chosen collective variables can be studied using 
Laio-Parrinello metadynamics [|98l . 

Microcanonical (NVE) MD is performed on the BO surface, i.e. achieving electron self- 
consistency at each time step, using the Verlet algorithm [|99l . Canonical (NVT) dynamics 
can be performed using velocity rescaling, or Anderson's or Berendsen's thermostats HIOOII . 
Constant-pressure (NPT) MD is performed by adding additional degrees of freedom for the 
cell size and volume, using either the Parrinello-Rahman Lagrangian filOlll or the so-called 
invariant Lagrangian of Wentzcovitch fi53il . 

The effects of finite macroscopic electric fields on the electronic structure of the ground 
state can be accounted for either through the method of Ref [11021 1 10311 based on the Berry 
phase, or (for slab geometries only) through a sawtooth external potential [|104[ I105II . A 
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quantum fragment can be embedded in a complex electrostatic environment that includes 
a model solvent II106II and a counterion distribution [I107II . as is typical of electrochemical 
systems. 

4.2. CP 

The CP code is the specialized module performing Car-Parrinello ab initio MD. CP can 
use both NC PPs [I108II and US PPs [11091 [801 . In the latter case, the electron density is 
augmented through a Fourier interpolation scheme in real space ("box grid") [fSOl [STl that 
is particularly efficient for large scale calculations. CP implements the same functionals as 
PWscf , with the exception of hybrid functionals; a simplified one-electron self-interaction 
correction (SIC)[TjO| is also available. The Car-Parrinello Lagrangian can be augmented 
with Hubbard U corrections HUlll . or Hubbard-based penalty functionals to impose arbitrary 
oxidation states [I112II . 

Since the main applications of CP are for large systems without translational symmetry 
(e.g. liquids, amorphous materials), Brillouin zone sampling is restricted to the V point of 
the supercell, allowing for real instead of complex wavefunctions. Metallic systems can be 
treated in the framework of "ensemble DFT" II113II . 

In the Car-Parrinello algorithm, microcanonical (NVE) MD is performed on both 
electronic and nuclear degrees of freedom, treated on the same footing, using the Verlet 
algorithm. The electronic equations of motion are accelerated through a preconditioning 
scheme 111141 . Constant-pressure (NPT) MD is performed using the Parrinello-Rahman 
Lagrangian HlOlll and additional degrees of freedom for the cell. Nose-Hoover thermostats 
[I115II and Nose-Hoover chains [I116II allow to perform simulations in the different canonical 
ensembles. 

CP can also be used to directly minimize the energy functional to self-consistency 
while keeping the nuclei fixed, or to perform structural minimizations of nuclear positions, 
using the "global minimization" approaches of Refs. [|117[ I118L and damped dynamics or 
conjugate-gradients on the electronic or ionic degrees of freedom. It can also perform NEB 
and metadynamics calculations. 

Finite homogeneous electric fields can be accounted for using the Berry phase method, 
adapted to systems with the V point only 111021 . This advanced feature can be used in 
combination with MD to obtain the infrared spectra of liquids 11102111191 . the low- and high- 
frequency dielectric constants [I1021I1201I and the coupling factors required for the calculation 
of vibrational properties, including infrared, Raman [|1211ll22lll23ll . and hyper- Raman [1124| 
spectra. 

4.3. PHonon 

The PHonon package implements density-functional perturbation theory (DFPT) [|54ll55ll56ll 
for the calculation of second- and third-order derivatives of the energy with respect to atomic 
displacements and to electric fields. The global minimization approach [11251 1126[[ is used 
for the special case of normal modes in finite (molecular) systems, where no BZ sampling is 
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required (Gamma code). In the general case a self-consistent procedure [l55ll is used, with 
the distinctive advantage that the response to a perturbation of any arbitrary wavelength 
can be calculated with a computational cost that is proportional to that of the unperturbed 
system. Thus, the response at any wavevector, including very small (long-wavelength) ones, 
can be inexpensively calculated. This latter approach, and the technicalities involved in the 
calculation of effective charges and interatomic force constants, are described in detail in Refs. 
11551112711 and implemented in the PH code. 

Symmetry is fully exploited in order to reduce the amount of computation. Lattice 
distortions transforming according to irreducible representations of small dimensions are 
generated first. The charge-density response to these lattice distortions is then sampled at 
a number of discrete k-points in the BZ, which is reduced according to the symmetry of the 
small group of the phonon wavevector q. The grid of the q points needed for the calculation 
of interatomic force constants reduces to one wavevector per star: the dynamical matrices at 
the other q vectors in the star are generated using the symmetry operations of the crystal. This 
approach allows us to speed up the calculation without the need to store too much data for 
symmetrization. 

The calculation of second-order derivatives of the energy works also for US PP [|128[|129l 
and for all GGA flavors [|130[|1311 used in PWscf and in CP. The extension of PHonon to 
PAW [I132L to noncoUinear magnetism and to fully relativistic US PPs which include spin- 
orbit coupling [I133II will be available by the time this paper will be printed. 

Advanced features of the PHonon package include the calculation of third order 
derivatives of the energy, such as electron-phonon or phonon-phonon interaction coefficients. 
Electron-phonon interactions are straightforwardly calculated from the response of the self- 
consistent potential to a lattice distortion. This involves a numerically- sensitive "double- 
delta" integration at the Fermi energy, that is performed using interpolations on a dense 
k-point grid. Interpolation techniques based on Wannier functions HI 3411 will speed up 
considerably these calculations. The calculation of the anharmonic force constants from 
third-order derivatives of the electronic ground- state energy is described in Ref. I1135II and 
is performed by a separate code called d3. Static Raman coefficients are calculated using 
the second-order response approach of Refs. [11361 113711 . Both third-order derivatives and 
Raman-coefficients calculations are currently implemented only for NC PPs. 

4.4. atomic 

The atomic code performs three different tasks: i) solution of the self-consistent all-electron 
radial KS equations (with a Coulomb nuclear potential and spherically symmetric charge 
density); ii) generation of NC PPs, of US PPs, or of PAW data-sets; Hi) test of the above PPs 
and data-sets. These three tasks can be either separately executed or performed in a single run. 
Three different all-electron equations are available: i) the non relativistic radial KS equations, 
ii) the scalar relativistic approximation to the radial Dirac equations [I138II . Hi) the radial Dirac- 
hke equations derived within relativistic density functional theory [113911 1401 . For i) and ii) 
atomic magnetism is dealt with within the local spin density approximation, i.e. assuming 
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an axis of magnetization. The atomic code uses the same exchange and correlation energy 
routines of PWscf and can deal with the same functionals. 

The code is able to generate NC PPs directly in separable form (also with multiple 
projectors per angular momentum channel) via the TrouUier-Martins [ 14111 or the Rappe- 
Rabe-Kaxiras-Joannopoulos |142] pseudization. US PPs can be generated by a two-step 
pseudization process, starting from a NC PPs, as described in Ref. [I143II . or using the solutions 
of the all-electron equation and pseudizing the augmentation functions [l80l . The latter method 
is used also for the PAW data-set generation. The generation of fully relativistic NC and 
US PPs including spin-orbit coupling effects is also available. Converters are available to 
translate pseudopotentials encoded in different formats (e.g. according to the Fritz-Haber 11751 
or Vanderbilt conventions) into the UPF format adopted by QUANTUM ESPRESSO. 

Transferability tests can be made simultaneously for several atomic configurations with 
or without spin-polarization, by solving the non relativistic radial KS equations generalized 
for separable nonlocal PPs and for the presence of an overlap matrix. 

4.5. PWcond 

The PWcond code implements the scattering approach proposed by Choi and Ihm [i60ll for 
the study of coherent electron transport in atomic- sized nanocontacts within the Landauer- 
Buttiker theory. Within this scheme the linear response ballistic conductance is proportional 
to the quantum- mechanical electron transmission at the Fermi energy for an open quantum 
system consisting of a scattering region (e.g., an atomic chain or a molecule with some 
portions of left and right leads) connected ideally from both sides to semi-infinite metallic 
leads. The transmission is evaluated by solving the KS equations with the boundary conditions 
that an electron coming from the left lead and propagating rightwards gets partially reflected 
and partially transmitted by the scattering region. The total transmission is obtained by 
summing all transmission probabilities for all the propagating channels in the left lead. As a 
byproduct of the method, the PWcond code provides the complex band structures of the leads, 
that is the Bloch states with complex in the direction of transport, describing wave functions 
exponentially growing or decaying in the z direction. The original method formulated with 
NC PPs has been generalized to US PPs both in the scalar relativistic H 14411 and in the fully 
relativistic forms [I145L 

4.6. GIPAW 

The GIPAW code allows for the calculation of physical parameters measured by i) nuclear 
magnetic resonance (NMR) in insulators (the electric field gradient (EFG) tensors and the 
chemical shift tensors), and by ii) electronic paramagnetic resonance (EPR) spectroscopy for 
paramagnetic defects in solids or in radicals (the hyperfine tensors and the g-tensor). The 
code also computes the magnetic susceptibility of nonmagnetic insulators. GIPAW is based 
on the PW-PP method, and uses many subroutines of PWscf and of PHonon. The code is 
currently restricted to NC PPs. All the NMR and EPR parameters depend on the detailed 
shape of the electronic wave-functions near the nuclei and thus require the reconstruction of 
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the all-electron wave-functions from the PP wave-functions. For the properties defined at 
zero external magnetic field, namely the EFG and the hyperfine tensors, such reconstruction 
is performed as a post-processing step of a self-consistent calculation using the PAW 
reconstruction, as described for the EFG in Ref. lfT46ll and for the hyperfine tensor in Ref. 
II147II . The g-tensor, the NMR chemical shifts and the magnetic susceptibility are obtained 
from the orbital linear response to an external uniform magnetic field. In the presence of a 
magnetic field the PAW method is no more gauge- and translationally invariant. Gauge and 
translational invariances are restored by using the gauge including projector augmented wave 
(GIPAW) method [|63l |64]| both i) to describe in the PP Hamiltonian the coupling of orbital 
degrees of freedom with the external magnetic field, and ii) to reconstruct the all-electron 
wave-functions, in presence of the external magnetic field. In addition, the description of 
a uniform magnetic field within periodic boundary conditions is achieved by considering 
the long wave-length limit of a sinusoidally modulated field in real space [|148[ 11491 . The 
NMR chemical shifts are computed following the method described in Ref. [l63l . the g-tensor 
following Ref. [1150 1 and the magnetic susceptibility following Refs. [|148[ [63l . Recently, 
a "converse" approach to calculate chemical shifts has also been introduced [I151II . based 
on recent developments on the Berry-phase theory of orbital magnetization; since it does not 
require a linear-response calculation, it can be straightforwardly applied to arbitrarily complex 
exchange-correlation functional, and to very large systems, albeit at a computational cost that 
is proportional to the number of chemical shifts that need to be calculated. 

4.7. XSPECTRA 

The XSPECTRA code allows for the calculation of K-edge X-ray absorption spectra (XAS). 
The code calculates the XAS cross-section including both dipolar and quadrupolar matrix 
elements. The code uses the self-consistent charge density produced by PWscf and acts as a 
post-processing tool. The all-electron wavefunction is reconstructed using the PAW method 
and its implementation in the GIPAW code. The presence of a core-hole in the final state of the 
X-ray absorption process is simulated by using a pseudopotential for the absorbing atom with 
a hole in the Is state. The calculation of the charge density is performed on a supercell with 
one absorbing atom. From the self-consistent charge density, the X-ray absorption spectra 
are obtained using the Lanczos method and a continued fraction expansion [l65l I152II . The 
advantage of this approach is that once the charge density is known it is not necessary to 
calculate empty bands to describe very high energy features of the spectrum. Correlation 
effects can be simulated in a mean-field way using the Hubbard U correction [86] that has 
been included in the XSPECTRA code in Ref. [I153L Currently the code is limited to coUinear 
magnetism. Its extension to noncoUinear magnetism is under development. 

4.8. Wannier90 

Wannier9 [[26l 1 15411 is a code that calculates maximally-localized Wannier functions in 
insulators or metals — according to the algorithms described in Refs. ll6Tl[62ll — and a number 
of properties that can be conveniently expressed in a Wannier basis. The code is developed 
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and maintained independendy by a Wannier development group HI 5411 and can be taken as 
a representative example of the philosophy described earlier, where a project maintains its 
own individual distribution but provides full interoperability with the core components of 
Quantum ESPRESSO in this case PWscf or CP. These codes are in fact used as "quantum 
engines" to produce the data onto which Wannier 90 operates. The need to provide 
transparent protocols for interoperability has in turn facilitated the interfacing of wannier 9 
with other quantum engines [|2Tl[T4ll . fostering a collaborative engagement with the broader 
electronic- structure community that is also in the spirit of QUANTUM ESPRESSO . 

Wannier 9 requires as input the scalar products between wavefunctions at neighboring 
k-points, where these latter form uniform meshes in the Brillouin zone. Often, it is also 
convenient to provide scalar products between wavefunctions and trial, localized real- space 
orbitals — these are used to guide the localization procedure towards a desired, physical 
minimum. As such, the code is not tied to a representation of the wavefunctions in any 
particular basis — for PWscf and CP a post-processing utility is in charge of calculating these 
scalar products using the plane-wave basis set of Quantum ESPRESSO and either NC-PPs 
or US-PPs. Whenever T sampling is used, the simplified algorithm of Ref. [I155II is adopted. 

Besides calculating maximally localized Wannier functions, the code is able to construct 
the Hamiltonian matrix in this localized basis, providing a chemically accurate, and 
transferable, tight-binding representation of the electronic structure of the system. This, 
in turn, can be used to construct Green's functions and self-energies for ballistic transport 
calculations [|156[ 11571 . to determine the electronic structure and DOS of very large scale 
structures [I157II . to interpolate accurately the electronic band structure (i.e. the Hamiltonian) 
across the Brillouin zone [|157[ 11581 . or to interpolate any other operator |158]. These latter 
capabilities are especially useful for the calculation of integrals that depend sensitively on a 
submanifold of states; common examples come from properties that depend sensitively on the 
Fermi surface, such as electronic conductivity, electron-phonon couplings Knight shifts, or the 
anomalous Hall effect. A related by-product of Wannier 9 is the capability of downfolding 
a selected, physically significant manifold of bands into a minimal but accurate basis, to be 
used for model Hamiltonians that can be treated with complex many-body approaches. 

4.9. PostProc 

The PostProc module contains a number of codes for post-processing and analysis of data 
files produced by PWscf and CP. The following operations can be performed: 

• Interfacing to graphical and molecular graphics applications. Charge and spin density, 
potentials, ELF lt68l and STM images lIFTIl are extracted or calculated and written to files 
that can be directly read by most common plotting programs, like xcrysden [I159II and 
VMD [fT60l . 

• Interfaces to other codes that use DFT resuks from QUANTUM ESPRESSO for 
further calculations, such as e.g.: pw2 wannier 9 0, an interface to the wannier 9 
library and code [|26lll54|| (also included in the Quantum ESPRESSO distribution); 
pw2casino . f 90, an interface to the casino quantum Monte Carlo code [I161II ; 
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wannier_ham . f 90, a tool to build a tight-binding representation of the KS 
Hamiltonian to be used by the dmft code HI 6211 (available at the qe-forge site); 
pw_export . f 90, an interface to the GW code SaX [I163II : pw2gw . f 90, an interface 
to code DP 1164] for dielectric property calculations, and to code EXC [I165II for excited- 
state properties. 

• Calculation of various quantities that are useful for the analysis of the results. In 
addition to the already mentioned ELF and STM, one can calculate projections over 
atomic states (e.g. Lowdin charges [|69l ), DOS and Projected DOS (PDOS), planar and 
spherical averages, and the complex macroscopic dielectric function in the random-phase 
approximation (RPA). 
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Figure 2. Snapshot of the PWgui application. Left: PWgui's main window; right: preview 
of specified input data in text mode. 



4.10. PWgui 

PWgui is the graphical user interface (GUI) for the PWscf , PHonon, and atomic packages 
as well as for some of the main codes in PostProc (e.g. pp . x and pro jwf c . x). PWgui 
is an input file builder whose main goal is to lower the learning barrier for the newcomer, 
who has to struggle with the input syntax. Its event-driven mechanism automatically adjusts 
the display of required input fields (i.e. enables certain sets of widgets and disables others) 
to the specific cases selected (see Fig. [2[ left panel). It enables a preview of the format of 
the (required) input file records for a given type of calculation (see Fig. |2} right panel). The 
input files created by PWgui are guaranteed to be syntactically correct (although they can 
still be physically meaningless). It is possible to upload previously generated input files for 
syntax checking and/or to modify them. It is also possible to run calculations from within 
the PWgui. In addition, PWgui can also use the external xcrysden program [11591 for the 
visualization of molecular and/or crystal structures from the specified input data and for the 
visualization of properties (e.g. charge densities or STM images). 
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Table 1. Summary of parallelization levels in QUANTUM ESPRESSO, 
distributed quantities communications performance 



image 



pool 



NEB images 



k-points 



very low 



low 



plane-wave plane waves, G-vector high 
coefficients, R-space 
EFT arrays 



linear CPU scahng, 
fair to good load balancing; 
does not distribute RAM 
almost linear CPU scaling, 
fair to good load balancing; 
does not distribute RAM 
good CPU scaling, 
good load balancing, 
distributes most RAM 



task EFT on electron states high 

linear algebra subspace Hamiltonians very high 
and constraints matrices 



improves load balancing 
improves scaling, 
distributes more RAM 



As the Quantum ESPRESSO codes evolve, the input file syntax expands as well. This 
implies that PWgui has to be continuously adapted. To effectively deal with such issue, 
PWgui uses the GO IB concept [il66fl . GO IB builds on the consideration that the input files 
for numerical simulation codes have a rather simple structure and it exploits this simplicity by 
defining a special meta-language with two purposes: the first is to define the input-file syntax, 
and the second is to simultaneously automate the construction of the GUI on the basis of such 
a definition. 

A similar strategy has been recently adopted for the description of the QUANTUM 
ESPRESSO input file formats. A single definition/description of a given input file serves i) as 
a documentation perse, ii) as a PWgui help documentation, and Hi) as a utility to synchronize 
the PWgui with up-to-date input file formats. 

5. Parallelization 

Keeping the pace with the evolution of high-end supercomputers is one of the guiding lines in 
the design of QUANTUM ESPRESSO, with a significant effort being dedicated to porting it 
to the latest available architectures. This effort is motivated not only by the need to stay at the 
forefront of architectural innovation for large to very-large scale materials science simulations, 
but also by the speed at which hardware features specifically designed for supercomputers find 
their way into commodity computers. 

The architecture of today's supercomputers is characterized by multiple levels and layers 
of inter-processor communication: the bottom layer is the one affecting the instruction set of a 
single core (simultaneous multithreading, hyperthreading); then one has parallel processing at 
processor level (many CPU cores inside a single processor sharing caches) and at node level 
(many processors sharing the same memory inside the node); at the top level, many nodes 
are finally interconnected with a high-performance network. The main components of the 
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Quantum ESPRESSO distribution are designed to exploit this highly structured hardware 
hierarchy. High performance on massively parallel architectures is achieved by distributing 
both data and computations in a hierarchical way across available processors, ending up with 
multiple parallelization levels [I167II that can be tuned to the specific application and to the 
specific architecture. This remarkable characteristic makes it possible for the main codes of 
the distribution to run in parallel on most or all parallel machines with very good performance 
in all cases. 

More in detail, the various parallelization levels are geared into a hierarchy of processor 
groups, identified by different MPI communicators. In this hierarchy, groups implementing 
coarser-grained parallel tasks are split into groups implementing finer-grained parallel tasks. 
The first level is image parallelization, implemented by dividing processors into Uimage 
groups, each taking care of one or more images (i.e. a point in the configuration space, 
used by the NEB method). The second level is pool parallelization, implemented by further 
dividing each group of processors into ripooi pools of processors, each taking care of one or 
more k-points. The third level is plane-wave parallelization, implemented by distributing 
real- and reciprocal- space grids across the npw processors of each pool. The final level is 
task group parallelization [I168II . in which processors are divided into ritask task groups of 
npFT = npw/ntask processors, each one taking care of different groups of electron states 
to be Fourier-transformed, while each EFT is parallelized inside a task group. A further 
paralellization level, linear-algebra, coexists side-to-side with plane- wave parallelization, i.e. 
they take care of different sets of operations, with different data distribution. Linear- algebra 
parallelization is implemented both with custom algorithms and using ScaLAPACK [,16911 . 
which on massively parallel machines yield much superior performances. Table [T] contains a 
summary of the five levels currently implemented. With the recent addition of the two last 
levels, most parallelization bottlenecks have been removed, while both computations and data 
structures are fully distributed. 

This being said, the size and nature of the specific application set quite natural limits to 
the maximum number of processors up to which the performances of the various codes are 
expected to scale. Eor instance, the number of k— points calculation sets a natural limit to 
the size of each pool, or the number of electronic bands sets a limit for the parallelization of 
the linear algebra operations. Moreover some numerical algorithms scale better than others. 
Eor example, the use of norm-conserving pseudopotentials allows for a better scaling than 
ultrasoft pseudopotentials for a same system, because a larger plane-wave basis set and a 
larger real- and reciprocal-space grids are required in the former case. On the other hand, 
using ultrasoft pseudopotentials is generally faster because the use of a smaller basis set is 
obviously more efficient, even though the overall parallel performance may not be as good. 

Simulations on systems containing several hundreds of atoms are by now quite standard 
(see Eig. |3] for an example). Scalability does not yet extend to tens of thousands of 
processors as in especially-crafted codes like QBox [I170II . but excellent scalability on up 
to 4800 processors has been demonstrated (see Eig. |4]) even for cases where coarse-grained 
parallelization does not help, using only MPI parallelization. We remarks that the results 
for CNT (2) in Eig. |4] were obtained with an earlier version of the CP code that didn't use 
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Figure 3. Scalability for medium-size calculations (CP code). CPU time (s) per electronic time 
step (left panel) and speedup with respect to 32 processors (right panel) as a function of the 
number of processors and for different numbers ntask of task groups, on a IBM BlueGene/P 
(BG/P) and on a SGI Altix. The system is a fragment of an A/?— peptide in water containing 
838 atoms and 2311 electrons in a 22.1 x 22.9 x 19.9 cell, ultrasoft pseudopotentials, T 
point, 25 Ry and 250 Ry cutoff for the orbitals and the charge density respectively. 



ScaLAPACK; the current version performs better in terms of scalability. 




Figure 4. Scalability for large-scale calculations: Wall time (left panel) and speedup (right 
panel) as a function of the number of processors. PSIWAT: PWscf code, Upooi = 4, 
n-task = 4, on a Cray XT 4. The system is a gold surface covered by thiols in interaction 
with water, 4 k-points, 10.59 x 20.53 x 32.66 A^ cell, 587 atoms, 2552 electi'ons. CNT (1): 
PWscf code, ntask = 4, on a Cray XT 4. The system is a porphyrin-functionalized nanotube, 
r point, 1532 atoms, 5232 electrons. CNT (2): CP code on a Cray XT3, same system as 
for CNT (1), Times for PSIWAT and CNT (1) are for 10 and 2 self-consistency iterations, 
respectively; times for CNT (2) are for 10 electronic steps plus 1 Car-Parrinello step, divided 
by 2 so that they fall in the same range as for CNT (1). 

The efforts of the QUANTUM ESPRESSO developers' team are not limited to the 
performance on massively parallel architectures. Special attention is also paid to optimize 
the performances for simulations of intermediate size (on systems comprising from several 
tens to a few hundreds inequivalent atoms), to be performed on medium-size clusters, readily 
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available to many groups [ISTIl . In particular, the QUANTUM ESPRESSO developers' 
team is now working to better exploit new hardware trends, particularly in the field of 
multicore architectures. The current version implements a partial but fully functional OpenMP 
parallelization [il71il . that is especially suitable for modern multicore CPU's. Mixing OpenMP 
with MPI also allows to extend scalability towards a higher number of processors, by adding 
a parallelization level on top of what can already be achieved using MPI. Preliminary tests on 
realistic physical systems demonstrate scalability up to 65536 cores, so far. 

Looking ahead, future developments will likely focus on hybrid systems with hardware 
accelerators (CPUs and cell co-processors). 

6. Perspectives and Outlook 

Further developments and extensions of Quantum ESPRESSO will be driven by the needs 
of the community using it and working on it. Many of the soon-to-come additions will deal 
with excited-state calculations within time-dependent DFT (TDDFT II1721I1731 ) and/or many- 
body perturbation theory [I174L A new approach to the calculation of optical spectra within 
TDDFT has been recently developed [I175II . based on a finite-frequency generalization of 
density-functional perturbation theory [[5411551, and implemented in Quantum ESPRESSO. 
Another important development presently under way is an efficient implementation of GW 
calculations for large systems (whose size is of the order of a few hundreds inequivalent 
atoms) [ 1761 . The implementation of efficient algorithms for calculating correlation energies 
at the RPA level is also presently under way II177[ 11781 1179II . It is foreseen that by the time 
this paper will appear, many of these developments will be publicly released. 

It is hoped that many new functionalities will be made available to QUANTUM 
ESPRESSO users by external groups who will make their own software compati- 
ble/interfaceable with Quantum ESPRESSO. At the time of the writing of the present pa- 
per, third-party scientific software compatible with Quantum ESPRESSO and available to 
its users' community include: yambo, a general-purpose code for excited-state calculations 
within many-body perturbation theory [I180II : casino, a code for electronic-structure quan- 
tum Monte Carlo simulations [161J; want, a code for the simulation of ballistic transport 
in nanostructures, based on Wannier functions P 181II : xcrysden, a molecular graphics ap- 
plication, especially suited for periodic structures II159II . The qe- forge portal is expected 
to burst the production and availability of third-party software compatible with QUANTUM 
ESPRESSO. Among the projects already available, or soon-to-be available, on qe-f orge, 
we mention: SaX [I163L an open-source project implementing state-of-the-art many-body 
perturbation theory methods for excited states; dmf t [I162II . a code to perform Dynamical 
Mean-Field Theory calculations on top of a tight-binding representation of the DFT band 
structure; qha, a set of codes for calculating thermal properties of materials within the quasi- 
harmonic approximation II182II ; pwtk, a fully functional Tel scripting interface to PWscf 

[HH. 

Efforts towards better interoperability with third-party software will be geared towards 
releasing accurate specifications for data structures and data file formats and providing 
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interfaces to and from other codes and packages used by the scientific community. Further 
work will be also devoted to the extension to the US-PPs and PAW schemes of the parts of 
Quantum ESPRESSO that are now limited to NC-PPs. 

The increasing availability of massively parallel machines will likely lead to an increased 
interest towards large-scale calculations. The ongoing effort in this field will continue. A 
special attention will be paid to the requirements imposed by the architecture of the new 
machines, in particular multicore CPUs, for which a mixed OpenMP-MPI approach seems 
to be the only viable solution yielding maximum performances. Grid computing and the 
commoditization of computer cluster will also lead to great improvements in high-throughput 
calculations for materials design and discovery. 

The new trend towards distributed computing is exemplified by the recent development of 
the Vlab cyber-infrastructure (CI) Ill84[|185ll . a service-oriented architecture (SOA) that uses 
Quantum ESPRESSO as the back-end computational package plus a web portal [|186L This 
SOA consists of scientific workflows for calculations of high-pressure (P) and temperature 
(T) properties of materials HI 871 . programmed as a collection of web services running in 
distributed environments, plus analysis tools to monitor workflow execution and visualization 
tools. High PT properties of a inexhaustible series of minerals is essential for the 
interpretation of seismic data and as input for geodynamic simulations. The VLab-Cl was 
developed to: 1) handle the job deluge created by the large number of points (10^-10^) in 
the parameter (pressures, strains, phonon q-points, composition) space sampled by these 
calculations, each point consisting of a first-principle task (PWscf or PHonon execution); 
2) handle the information flow between multi-leveled groups of tasks, with outputs from one 
level used to generate inputs for the next level; 3) harness the scalable aggregated throughput 
power of scattered computational resources. 
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Appendix 

This appendix contains the description of some algorithms used in QUANTUM ESPRESSO 
that have not been documented elsewhere. 
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Appendix A. 1. Self-consistency 

The problem of finding a self-consistent solution to the KS equations can be recast into the 
solution of a nonlinear problem 

X = F[x], X = (xi,X2, . . . ,XAr), (A.l) 

where vector x contains the Fourier components or real-space values of the charge density 
p or the KS potential V (the sum of Hartree and exchange-correlation potentials); F[x*^™^] is 
a functional of the input charge density or potential x*^*"), yielding the output vector x*^""*) via 
the solution of KS equations. A solution can be found via an iterative procedure. PWscf uses 
an algorithm based on the modified Broyden method ll88l in which x contains the components 
of the charge density in reciprocal space. Mixing algorithms typically find the optimal linear 
combination of a few x*^*") from previous iterations, that minimizes some suitably defined 
norm | jx*^""*^ — x*^*"^ 1 1 , vanishing at convergence, that we will call in the following "scf norm". 

Ideally, the scf norm is a measure of the self-consistency error on the total energy. Let 
us write an estimate of the latter for the simplest case: an insulator with NC PPs and simple 
LDA or GGA. At a given iteration we have 

f-;(^V2 + VUr) + V^'^^^r)] ijiir) = e^r), (A.2) 
y 2m J 

where and ipi are KS energies and orbitals respectively, i labels the occupied states, Vext 
is the sum of the PPs of atomic cores (written for simplicity as a local potential), the input 
Hartree and exchange-correlation potential ^(*")(r) = Vhxc[p^^"\''^)] is a functional of the 
input charge density p*^*"^ . The output charge density is given by 

^(o«*)(r) = ^|,^^(r)|2. (A.3) 

i 

Let us compare the DFT energy calculated in the standard way: 



E 



E / ^r(r) -^V^ + V,xt{r) U^)dv + Enxc[p'^"''% (A.4) 



where Ehxc is the Hartree and exchange-correlation energy, with the Harris-Weinert-Foulkes 
functional form, which doesn't use p(""*): 



E' = Y.j ^r(r) (^-^V^ + V,xt{v) + \/(^")(r)j ^.(r)dr 

-/ p^'^^^V^'^^Kv) + EhxAp^'""^] (A.5) 

Both forms are variational, i.e. the first-order variation of the energy with respect to the charge 
density vanish, and both converge to the same result when self-consistency is achieved. Their 
difference can be approximated by the following expression, in which only the dominant 
Hartree term is considered: 

2 J r-r' 



1 



Ap(r)Ayf^(r')cir (A.6) 
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where Ap = p(""*) — p(*") and AVh is the Hartree potential generated by Ap. Moreover it 
can be shown that, when exchange and correlation contributions to the electronic screening do 
not dominate over the electrostatic ones, this quantity is an upper bound to the self-consistent 
error incurred when using the standard form for the DFT energy. We therefore take this term, 
which can be trivially calculated in reciprocal space, as our squared scf norm: 

where G are the vectors in reciprocal space and Vl is the volume of the unit cell. 

Once the optimal linear combination of p*-*"^^ from previous iterations (typically 4 to 8) is 
determined, one adds a step in the new search direction that is, in the simplest case, a fraction 
of the optimal Ap or, taking advantage of some approximate electronic screening JTSHl, a 
preconditioned Ap. In particular, the simple, Thomas-Fermi, and local Thomas-Fermi mixing 
described in Ref. [118811 are implemented and used. 

The above algorithm has been extended to more sophisticated calculations, in which 
the X vector introduced above may contain additional quantities: for DFT-i-U, occupancies 
of atomic correlated states; for meta-GGA, kinetic energy density; for R\W, the quantities 
Y^iii^ilPn) iPml'ipi) ^ whcrc the P functions are the atomic-based projectors appearing in the 
PAW formalism. The scf norm is modified accordingly in such a way to include the additional 
variables in the estimated self-consistency error. 



Appendix A.2. Iterative diagonalization 

During self-consistency one has to solve the generalized eigenvalue problem for all 
occupied states 

H^, = eiSij„ t = l,...,N (A.8) 

in which both H (the Hamiltonian) and S (the overlap matrix) are available as operators (i.e. 
H^jJ and S^' products can be calculated for a generic state ^ ). Eigenvectors are normalized 
according to the generalized orthonormality constraints {'ipi\S\-ipj) = Sij. This problem is 
solved using iterative methods. Currently PWscf implements a block Davidson algorithm 
and an alternative algorithm based on band-by-band minimization using conjugate gradient. 



Appendix A.2.1. Davidson One starts from an initial set of orthonormalized trial orbitals 
ipf'^ and of trial eigenvalues ef* = {ipf'^\H\iljf^'^). The starting set is typically obtained from 
the previous scf iteration, if available, and if not, from the previous time step, or optimization 
step, or from a superposition of atomic orbitals. We introduce the residual vectors 

gr^ = iH-efS)^r\ (A.9) 

a measure of the error on the trial solution, and the correction vectors dtpf'^ = Dgf'\ where 
is a suitable approximation to {H — ef^ . The eigenvalue problem is then solved in the 
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2N 

Y,{H^u-e,S,u)cf = (A.IO) 

k=l 

where 

H,u = {ct>f\H\ct>f), ^,, = (0f (A.ll) 

Conventional algorithms for matrix diagonalization are used in this step. A new set of trial 
eigenvectors and eigenvalues is obtained: 

2N 

^f = T.^<tf. ^ = i^f\HVP?) (A.12) 
i=i 

and the procedure is iterated until a satisfactory convergence is achieved. Alternatively, one 
may enlarge the reduced basis set with the new correction vectors dip^^'' = Dgl^\ solve a 
3A^-dimensional problem, and so on, until a prefixed size of the reduced basis set is reached. 
The latter approach is typically slightly faster at the expenses of a larger memory usage. 

The operator D must be easy to estimate. A natural choice in the PW basis set is a 
diagonal matrix, obtained keeping only the diagonal term of the Hamiltonian: 

where k is the Bloch vector of the electronic states under consideration, |k + G') denotes 
PWs, e an estimate of the highest occupied eigenvalue. Since the Hamiltonian is a diagonally 
dominant operator and the kinetic energy of PWs is the dominant part at high G, this simple 
form is very effective. 



Appendix A.2. 2. Conjugate -Gradient The eigenvalue problem of Eq.( A.8 ) can be recast into 
a sequence of constrained minimization problems: 



mm 



(A. 14) 



where the Xj are Lagrange multipliers. This can be solved using a preconditioned conjugate 
gradient algorithm with minor modifications to ensure constraint enforcement. The algorithm 
here described was inspired by the conjugate-gradient algorithm of Ref. [1 189L and is similar 
to one of the variants described in Ref. [.190 J . 

Let us assume that eigenvectors ipj up to j = z — 1 have already been calculated. We 
start from an initial guess tp^^^ for the i-th eigenvector, such that {'tJj^^''\S\ip^^'') = 1 and 
{ip^'^''\S\ipj) = 0. We introduce a diagonal precondition matrix P and auxiliary functions 
y = P^^ij) and solve the equivalent problem 

mm[{y\H\y)-X[{y\S\y)-l)], (A.15) 

where H = PHP, S = PSP, under the additional orthonormality constraints {y\PS\ipj) = 
0. The starting gradient of Eq.( |A.15| )) is given by 

= (/?-A^)l/(°). (A.16) 
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By imposing that the gradient is orthonormal to the starting vector: {g^^^S\y''^^) = 0, one 
determines the value of the Lagrange multiplier: 

A = ^, I / . (A.17) 

(^(0)|52|^(0)) 

The remaining orthonormality constraints are imposed on Pg^'^^ by explicit orthonormaliza- 
tion (e.g. Gram-Schmid) to the ^j. We introduce the conjugate gradient h^^\ which for 
the first step is set equal to g^^^ (after orthonormalization), and the normalized direction 
n(o) = We search for the minimum of {y^^^\H\y^^^) along the direc- 

tion yW, defined as: dull 

y^"'^ cose + n^"hme. (A. 18) 



(1) 



y 

This form ensures that the constraint on the norm is correctly enforced. The calculation of the 
minimum can be analytically performed and yields 

where a(°) = 23ft(|/(°)|H'|n(°)), = and = The procedure 

is then iterated; at each step the conjugate gradient is calculated from the gradient and the 
conjugate gradient at the previous step, using the Polak-Ribiere formula: 

/iW = gin) ^^{n-l)}^{n-l)^ (^_20) 

(n-1) ^ \9 g ^ \^\g ) 21) 

{gin-l)\S\gin-l)) 

h^"-^ is subsequently re-orthogonalized to y^'^\ We remarks that in the practical 
implementation only Pg and Ph need to be calculated and that only P^ - the analogous 
of the D matrix of Davidson algorithm - is actually used. A kinetic-only form of P^ has 
proved satisfactory: 

2777 

(k + G\P'\k + G') = j^2^^^^y SGG'. (A.22) 
Appendix A. 3. Wavefunction extrapolation 

In molecular dynamics runs and in structural relaxations, extrapolations are employed to 
generate good initial guesses for the wavefunctions at time t + dt from wavefunctions at 
previous time steps. The extrapolation algorithms used are similar to those described in 
Ref. ni89ll . The alignment procedure, needed when wavefunctions are the results of a 
self-consistent calculation, is as follows. The overlap matrix Ojj between wavefunctions at 
consecutive time steps: 

Oi, = {Ait + dt)\S{t + rft)|^,-(t)), (A.23) 

can be used to generate the unitary transformation U 019111 that aligns + dt) to il){t): 
ipl [t + dt) = J2j Uijipj [t + dt) . Since O is not unitary, it needs to be made unitary via e.g. the 
unitarization procedure 

U = {0^0)-^^^0l (A.24) 
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The operation above is performed using a singular value decomposition: let the overlap matrix 
be O = vDw, where v and w are unitary matrix and D is a diagonal non-negative definite 
matrix, whose eigenvalues are close to 1 if the two sets of wavefunctions are very similar. The 
needed unitary transformation is then simply given by t/ ~ w'^v'^ . This procedure is simpler 
than the original proposal and prevents the alignment algorithm to break in the occasional 
situation where, due to level crossing in the band structure between subsequent time steps, 
one or more of the eigenvalues of the D matrix vanish. 



Appendix A.4. Symmetry 

Symmetry is exploited almost everywhere, with the notable exception of CP. The latter is 
devised to study aperiodic systems or large supercells where symmetry is either absent or of 
httle use even if present. 

In addition to lattice translations, the space group of a crystal contains symmetry 
operations S combining rotations and translations that leave the crystal unchanged: S = 
{i?|f}, where i? is a 3 x 3 orthogonal matrix, f is a vector (called fractional translation) 
and symmetry requires that any atomic position, is transformed into an equivalent one, 
Sts = R{ts + f) = T5(g). The rotational part of these operations defines the crystal point 
group. 

As a consequence of symmetry, roto-translated KS orbitals are KS orbitals with the 
rotated Bloch vector: 5'^i,k(r) = i>i,\i{R^^r — f) = 'ipi,Kk{r). Where, strictly speaking, the 
resulting wave-function at Kk does not necessarily have the same band index as the original 
one but could be some unitary transformation of states at Rk that share with it the same single- 
particle eigenvalue. Since quantities of physical interest are invariant for unitary rotations 
among degenerate states this additional complication has no effect on the final result. 

This is the basis for the symmetrization procedure used in PWscf . One introduces a 
non- symmetrized charge density (labeled by superscript calculated on the irreducible BZ 
(IBZ): 

^(n.)(r) = ^ E ^kl^.kWr (A.25) 

i kelBZ 

The factors Wk ("weights") are proportional to the number of vectors in the star (i.e. 
inequivalent k vectors among all the {-Rk} vectors generated by the point- group rotations) 
and are normalized to 1: SkeiBZ ^^^k = 1- Weights can either be calculated or deduced from 
the Uterature on the special-point technique[l89l|90l. The charge density is then symmetrized 
as: 

pW = ^E^P^^^^r) = ^EP^"^^(^"'r-f) (A.26) 

5 " S 

where the sum runs over all Ng symmetry operations. 

The symmetrization technique can be extended to all quantities that are expressed as 
sums over the BZ. Hellmann-Feynman forces on atom s are thus calculated as follows: 

F. = ^i:SF<-> = ^ (A.27) 

^ s s 
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where S^^{s) labels the atom into which the s— th atom transforms (modulo a lattice 
translation vector) after application of 5*^^, the symmetry operation inverse of S. In a similar 
way one determines the symmetrized stress, using the rule for matrix transformation under a 
rotation: 

= ]^ E E R-,Rps cj%'^. (A.28) 

The PHonon package supplements the above technique with a further strategy. Given the 
phonon wave-vector q, the small group of q (the subgroup of crystal symmetry operations 
that leave q invariant) is identified and the reducible representation defined by the 'iNat 
atomic displacements along cartesian axis is decomposed into ni„. irreducible representations 
(irreps) = 1, . . . ,nirr- The dimensions of the irreducible representations are small, 

with Uj < 3 in most cases, up to 6 in some special cases (zone -boundary wave-vectors q in 
nonsymmorphic groups). Each irrep, j, is therefore defined by a set of Uj linear combinations 
of atomic displacements that transform into each other under the symmetry operations of 
the small group of q. In the self-consistent solution of the linear response equations, only 
perturbations associated to a given irrep need to be treated together and different irreps can be 
solved independently. This feature is exploited to reduce the amount of memory required by 
the calculation and is suitable for coarse-grained parallelization and for execution on a Grid 
infrastructure [I192II . 

The wavefunction response, /\iIj^^^^^{v), to displacements along irrep j, 7^-^^ (where 
a = 1, . . . , z/j labels different partners of the given irrep), is then calculated. The lattice- 
periodic unsymmetrized charge response, Apq"J^^(r), has the form: 

ApS(r) = e-^''-4E ^ ^k^k,(r)A^^:fi(r), (A.29) 

i kGlBZ(q) 

where the notation IBZ(q) indicates the IBZ calculated assuming the small group of q as 
symmetry group, and the weights are calculated accordingly. The symmetrized charge 
response is calculated as 

APq.,«(r) = ^ E e^^^' E D{S,)pa ^GiR~'r - f ) (A.30) 

where D(Sq) is the matrix representation of the action of the symmetry operation = {-R|f } 
for the J — th irrep 'y'^'^K At the end of the self-consistent procedure, the force constant matrix 
Csa,t/3(q) (where s, t label atoms, a, (3 cartesian coordinates) is calculated. Force constants at 
all vectors in the star of q are then obtained using symmetry: 

Csa,tp{Rqi) = E-^«'5-R/37Cs'-i(s)<5,S-iW7(^)' (A.31) 
7,(5 

where S = {-R|f} is a symmetry operation of the crystal group but not of the small group of 

q- 
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Appendix A.5. Fock exchange 

Hybrid functionals are characterized by the inclusion of a fraction of exact {i.e. non-local) 
Fock exchange in the definition of the exchange-correlation functional. For a periodic system, 
the Fock exchange energy per unit cell is given by: 



where an insulating and non magnetic system is assumed for simplicity. Integrals and wave- 
function normalizations are defined over the whole crystal volume, V = NVl (fi being the unit 
cell volume), and the summations run over all occupied bands and all k-points defined in 
the BZ by Born-von Karman boundary conditions. The calculation of this term is performed 
exploiting the dual-space formalism: auxiliary codensities, pk',«'(r) = V^|^,^,(r)?/'k,t)(r) are 



computed in real space and transformed to reciprocal space by EFT, where the associated 
electrostatic energies are accumulated. The application of the Fock exchange operator to 
a wavefunction involves additional EFTs and real-space array multiplications. These basic 
operations need to be repeated for all the occupied bands and all the points in the BZ grid. 
For this reason the computational cost of the exact exchange calculation is very high, at least 
an order of magnitude larger than for non- hybrid functional calculations. 

In order to limit the computational cost, an auxiliary grid of q-points in the BZ, centered 
at the r point, can be introduced and the summation over k' be limited to the subset k' = k+q. 
Of course convergence with respect to this additional parameter needs to be checked, but often 
a grid coarser than the one used for computing densities and potentials is sufficient. 

The direct evaluation of the Fock energy on regular grids in the BZ is however 
problematic due to an integrable divergence that appears in the q — limit. This problem 
is addressed resorting to a procedure, first proposed by Gygi and Baldereschi [I193L where 
an integrable term that displays the same divergence is subtracted from the expression for 
the exchange energy and its analytic integral over the BZ is separately added back to it. 
Some care must still be paid [I177II in order to estimate the contribution of the q = term 
in the sum, which contains a 0/0 limit that cannot be calculated from information at q = 
only. This term is estimated [I177II assuming that the grid of q-points used for evaluating the 
exchange integrals is dense enough that a coarser grid, including only every second point in 
each direction, would also be equally accurate. Since the limiting term contributes to the 
integral with different weights in the two grids, one can extract its value from the condition 
that the two integral give the same result. This procedure removes an error proportional to 
the inverse of the unit cell volume ^7 that would otherwise appear if this term were simply 
neglected. 

References 

[1] N. Marzari. MRS BULLETIN, 31MI, 2006. 

[2] R. G. Parr and W. Yang. Density Functional Theory of Atoms and Molecules. Oxford University Press, 




(A.32) 



1989. 



Quantum ESPRESSO 



31 



[3] R. M. Dreizler and E. K. U. Gross. Density Functional Theory. Springer- Verlag, 1990. 
[4] R.M. Martin. Cambridge University Press, Cambridge, UK, 2004. 

[5] E. J. Baerends, J. Autschbach, A. Berces, F.M. Bickelhaupt, C. Bo, R M. Boerrigter, L. Cavallo, D. P. 
Chong, L. Deng, R. M. Dickson, D. E. Ellis, M. van Faassen, L. Fan, T. H. Fischer, C. Fonseca 
Guerra, S. J. A. van Gisbergen, A. W. Gotz, J. A. Groeneveld, O. V. Gritsenko, M. Griining, F. E. 
Harris, P. van den Hoek, C. R. Jacob, H. Jacobsen, L. Jensen, G. van Kessel, F. Kootstra, M. V. 
Krykunov, E. van Lenthe, D. A. McCormack, A. Michalak, J. Neugebauer, V. P. Nicu, V. P. Osinga, 
S. Patchkovskii, P H. T. Philipsen, D. Post, C. C. Pye, W. Ravenek, J. I. Rodriguez, P Ros, P R. T. 
Schipper, G. Schreckenbach, J. G. Snijders, M. Sola, M. Swart, D. Swerhone, G. te Velde, P. Vernooijs, 
L. Versluis, L. Visscher, O. Visser, F. Wang, T. A. Wesolowski, E. M. van Wezenbeek, G. Wiesenekker, 
S. K. Wolff, T. K. Woo, A. L. Yakovlev, and T. Ziegler. ADF2008.01, SCM, Theoretical Chemistry, 
Vrije Universiteit, Amsterdam, The Netherlands, http://www.scm.com. 

[6] Roberto Dovesi, Bartolomeo Civalleri, Roberto Orlando, Carla Roetti, and Victor R. Saunders, in Reviews 
in Computational Chemistry, Ch. 1, Vol. 21, K. B. Lipkowitz, R. Larter, T. R. Cundari editors, John 
Wiley and Sons, New York (2005). 

[7] G. Karlstrom, R. Lindh, P.-A. Malmqvist, B. O. Roos, U. Ryde, V. Veryazov, P.-O. Widmark, M. Cossi, 
B. Schimmelpfennig, P. Neogrady, and L. Seijo. Comp. Mater ScL, 28:222, 2003. 

[8] Reinhart Ahhichs, Filipp Furche, Christof Hattig, Willem M. Klopper, Marek Sierka, and Florian 
Weigend. TURBOMOLE - http://www.turbomole-gmbh.com. 

[9] M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, J. A. 
Montgomery, Jr., T. Vreven, K. N. Kudin, J. C. Burant, J. M. Millam, S. S. Iyengar, J. Tomasi, 
V. Barone, B. Mermucci, M. Cossi, G. Scalmani, N. Rega, G. A. Petersson, H. Nakatsuji, M. Hada, 
M. Ehara, K. Toyota, R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, 
M. Klene, X. Li, J. E. Knox, H. P. Hratchian, J. B. Cross, V. Bakken, C. Adamo, J. Jaramillo, 
R. Gomperts, R. E. Stratmann, O. Yazyev, A. J. Austin, R. Cammi, C. Pomelli, J. W. Ochterski, P. Y. 
Ayala, K. Morokuma, G. A. Voth, P. Salvador, J. J. Dannenberg, V. G. Zakrzewski, S. Dapprich, A. D. 
Daniels, M. C. Strain, O. Farkas, D. K. Mahck, A. D. Rabuck, K. Raghavachari, J. B. Foresman, J. V. 
Ortiz, Q. Cui, A. G. Baboul, S. Clifford, J. Cioslowski, B. B. Stefanov, G. Liu, A. Liashenko, P. Piskorz, 
L Komaromi, R. L. Martin, D. J. Fox, T. Keith, M. A. Al-Laham, C. Y. Peng, A. Nanayakkara, 
M. Challacombe, P M. W. Gill, B. Johnson, W. Chen, M. W. Wong, C. Gonzalez, and J. A. Pople. 
Gaussian 03, Revision C.02. Gaussian, Inc., Wallingford, CT, 2004, http://www.gaussian.com. 
[10] Materials Studio, http://accelrys.com/products/materials-studio. 

[11] Jaguar: rapid ab initio electronic structure package, Schrodinger, LLC, http://www.schrodinger.com/. 
[12] Spartan, Wavefunction, Inc., http://www.wavefun.com/products/spartan.html. 

[13] G. Kresse and J. Furthmuller. Comp. Mater ScL, 6(1): 15-50, JUL 1996. http://cmp.univie.ac.at/vasp. 

[14] http://www.castep.org, http://accelrys.com/products/materials-studio/modules/CASTEP.html. 

[15] The CPMD consortium page, coordinated by M. Parrinello and W. Andreoni, Copyright IBM Corp 1990- 
2008, Copyright MPl ftir Festkorperforschung Stuttgart 1997-2001, http://www.cpmd.org. 

[16] E. J. Bylaska, W. A. de Jong, N. Govind, K. Kowalski, T. P. Straatsma, M. Vahev, D. Wang, E. Apra, 
T. L. Windus, J. Hammond, P Nichols, S. Hirata, M. T. Hackler, Y. Zhao, P-D. Fan, R. J. Harrison, 
M. Dupuis, D. M. A. Smith, J. Nieplocha, V. Tipparaju, M. Krishnan, Q. Wu, T. Van Voorhis, 
A. A. Auer, M. Nooijen, E. Brown, G. Cisneros, G. I. Fann, H. Fruchtl, J. Garza, K. Hirao, 
R. Kendall, J. A. Nichols, K. Tsemekhman, K. Wolinski, J. Anchell, D. Bemholdt, P. Borowski, 
T. Clark, D. Clerc, H. Dachsel, M. Deegan, K. Dyall, D. Elwood, E. Glendening, M. Gutowski, 

A. Hess, J. Jaffe, B. Johnson, J. Ju, R. Kobayashi, R. Kutteh, Z. Lin, R. Littlefield, X. Long, 

B. Meng, T. Nakajima, S. Niu, L. Pollack, M. Rosing, G. Sandrone, M. Stave, H. Taylor, G. Thomas, 
J. van Lenthe, A. Wong, , and Z. Zhang. Nwchem, a computational chemistry package for parallel 
computers, version 5.1. Pacific Northwest National Laboratory, Richland, Washington 99352-0999, 
USA, http://www.emsl.pnl.gov/docs/nwchem/nwchem.html. 

[17] R. A. Kendall, E. Apra, D. E. Bemholdt, E. J. Bylaska, M. Dupuis, G. I. Fann, R. J. Harrison, J. Ju, 
J. A. Nichols, J. Nieplocha, T. P. Straatsma, T. L. Windus, and A. T. Wong. High performance 



Quantum ESPRESSO 



32 



computational chemistry: An overview of nwchem a distributed parallel application. Comput. Phys. 

Commun., 128:260-283, 2000. 
[18] M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert, M. S. Gordon, J. H. Jensen, S. Koseki, 

N. Matsunaga, K. A. Nguyen, S. J. Su, T. L. Windus, M. Dupuis, and J. A. Montgomery. General 

atomic and molecular electronic structure system. /. Comput. Chem., 14:1347-1363, 1993. 
[19] M. S. Gordon and M. W. Schmidt. Advances in electronic structure theory: Gamess a decade later. 

in Theory and Applications of Computational Chemistry, the first forty years, C. E. Dykstra, G. 

Frenking, K. S. Kim, G. E. Scuseria editors, ch. 41, pp 1167-1189, Elsevier, Amsterdam, 2005, 

http://www.msg.chem.iastate.edu/GAMESS/GAMESS.html. 
[20] http://www.gnu.org/licenses/. 

[21] X. Gonze, J.-M. Beuken, R. Caracas, F. Detraux, M. Fuchs, G.-M. Rignanese, L. Sindic, M. Verstraete, 
G. Zerah, F. Jollet, M. Torrent, A. Roy, M. Mikami, Ph. Ghosez, J.-Y. Raty, and D. C. Allan. First- 
principle computation of material properties: the abinit software project. Comp. Mater. Sci., 25:478, 
2002. http://www.abinit.org. 

[22] J. VandeVondele, M. Krack, F. Mohamed, M. Parrinello, T. Chassaing, and J. Hutter. Comput. Phys. 
Commun., 167:103,2005. http://cp2k.berlios.de. 

[23] S. R. Bahn and K. W. Jacobsen. An object-oriented scripting interface to a legacy electronic-structure 
code. Comput. Sci. Eng., 4:56, 2002. http://www.fysik.dtu.dk/campos. 

[24] J. J. Mortensen, L. B. Hansen, and K. W. Jacobsen. Phys. Rev. B, 71, 2005. https://wiki.fysik.dtu.dk/gpaw. 

[25] T. Daniel Crawford, C. David Sherrill, Edward F. Valeev, Justin T. Fermann, Rollin A. King, Matthew L. 
Leirunger, Shawn T. Brown, Curtis L. Janssen, Edward T. Seidl, Joseph P. Kenny, and Wesley D. AUen. 
J. Comput. Chem., 28(9):1610-1616, 2007. http://www.psicode.org. 

[26] A. A. Mostofi, J. R. Yates, Y.-S. Lee, I. Souza, D. Vanderbilt, and N. Marzari. Wannier90: A tool for 
obtaining maximally-localised wannier functions. Comput. Phys. Commun., 178:685, 2008. 

[27] http://qe-forge.org. 

[28] E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, 

S. Hammarling, A. McKermey, and D. Sorensen. LAPACK Users' Guide. Society for Industrial and 

Applied Mathematics, Philadelphia, PA, third edition, 1999. 
[29] Matteo Frigo and Steven G. Johnson. The design and implementation of FFTW3. Proceedings of 

the IEEE, 93(2):216-231, 2005. special issue on "Program Generation, Optimization, and Platform 

Adaptation". 

[30] Message Passing Interface Forum. Int J. Supercomputer Applications, 8 (3/4), 1994. 

[3 1 ] http://en.wikipedia.org/wiki/SourceForge. 

[32] W. E. Pickett. Comput. Phys. Rep., 9:115, 1989. 

[33] Ismaila Dabo, Boris Kozinsky, Nicholas E. Singh-Miller, and Nicola Marzari. Phys. Rev. B, 77:115139, 
2008. 

[34] L. Kleinman and D. M. Bylander. Phys. Rev Lett., 48:1425, 1982. 

[35] D. R. Hamann, M. Schluter, and C. Chiang. Phys. Rev. Lett., 43:1494, 1979. 

[36] D. Vanderbilt. Phys. Rev. B, 41:7892, 1990. 

[37] R E. Blochl. Phys. Rev. B, 50(24): 17953, Dec. 1994. 

[38] J. R Perdew, J. A. Chevary, S. H. Vosko, K. A. Jackson, D. J. Singh, and C. Fiolhais. Phys. Rev. B, 
46:6671, 1992. 

[39] J. Tao, J. P Perdew, V. N. Staroverov, and G. E. Scuseria. Phys. Rev. Lett, 91:146401, 2003. 
[40] J. P Perdew, M. Ernzerhof, and K. Burke. / Chem. Phys., 105:9982, 1996. 
[41] A. D. Becke. J. Chem. Phys., 98:1372, 1993. 

[42] R J. Stephens, F. J. Devlin, C. R Chabalowski, and M. J. Frisch. /. Phys. Chem., 98:11623, 1994. 

[43] W. Kohn and L. J. Sham. Phys. Rev., 140:A1133, 1965. 

[44] H. Hellmann. Einfuhrung in die Quantenchemie . DeutUche, 1937. 

[45] R. R Feynman. Phys. Rev., 56:340, 1939. 

[46] O. H. Nielsen and R. M. Martin. Phys. Rev. B, 32:3780, 1985. 

[47] R Pyykko. Chem. Rev., 88:563, 1988. 



Quantum ESPRESSO 



33 



[48] T. Oda, A. Pasquarello, and R. Car. Phys. Rev. Lett, 80:3622, 1998. 

[49] R. Gebauer and S. Baroni. Phys. Rev. B, 61:R6459, 2000. 

[50] R. Car and M. Parrinello. Phys. Rev. Lett., 55:2471, 1985. 

[51] R. M. Wentzcovitch and J. L. Martins. Sol. St. Commun., 78:831, 1991. 

[52] M. Parrinello and A. Rahman. Phys. Rev. Lett., 45(14):1196-1199, Oct 1980. 

[53] R. M. Wentzcovitch. Phys. Rev. B, 44:2358-2361, 1991. 

[54] S. Baroni, R Giannozzi, and A. Testa. Phys. Rev. Lett., 58:1861, 1987. 

[55] S. Baroni, S. de Gironcoli, A. Dal Corso, and P. Giannozzi. Rev. Mod. Phys., 73:515, 2001. 

[56] X. Gonze. Phys. Rev A, 52:1096, 1995. 

[57] G. Henkelman and H. Jonsson. J. Chem. Phys., 111:7010, 1999. 

[58] W. E, W. Ren, and E. Vanden-Eijnden. Phys. Rev. B, 66:052301, 2002. 

[59] K. J. Caspersen and E. A. Carter P Natl. Acad. Sci. USA, 102(19):6738-43, 2005. 

[60] H. J. Choi and J. Ihm. Phys. Rev. B, 59:2267, 1999. 

[61] N. Marzari and D. Vanderbilt. Phys. Rev. B, 56:12847, 1997. 

[62] I. Souza, N. Marzari, and D. Vanderbilt. Phys. Rev. B, 65:035109, 2001. 

[63] C. J. Pickard and F. Mauri. Phys. Rev, B 63:245101, 2001. 

[64] C. J. Pickard and F. Mauri. Phys. Rev Lett., 91:196401, 2003. 

[65] M. Taillefumier, D. Cabaret, A. M. Hank, and R Mauri. Phys. Rev. B, 66:195107, 2002. 
[66] S. Scandolo, P. Giannozzi, C. Cavazzoni, S. de GironcoU, A. Pasquarello, and S. Baroni. Z Kristallogr., 
220:51 A, 2005. 

[67] J. Tersoff and D. R. Hamann. Phys. Rev. B, 31:805-813, 1985. 

[68] A. D. Becke and K. E. Edgecombe. J. Chem. Phys., 92:5397-5403, 1990. 

[69] A. Szabo and N. Ostlund. Modern Quantum Chemistry. Dover, 1996. 

[70] A. Baldereschi, S. Baroni, and R. Resta. Phys. Rev Lett., 61:734, 1988. 

[71] The HDF Group, http://www.hdfgroup.org. 

[72] http://www.unidata.ucar.edu/software/netcdf. 

[73] http ://w w w.quantum- simulation. org. 

[74] G. Bussi. http://www.s3.infm.it/iotk. 

[75] Martin Fuchs and Matthias Scheffler. Comp. Phys. Comm., 119(1):67, 1999. 
[76] http://www.physics.rutgers.edu/ dhv/uspp/. 

[77] OPIUM - pseudopotential generation project, http://opium.sourceforge.net. 

[78] A. Dal Corso. A pseudopotential plane waves program (pwscf) and some case studies, m Lecture 

Notes in Chemistry, Vol. 67, C. Pisani editor. Springer Verlag, Berlin (1996). 
[79] G. Kresse and D. Joubert. P/!>x /?ev. B, 59(3): 1758-1775, Jan 1999. 

[80] K. Laasonen, A. Pasquarello, R. Car, C. Lee, and D. Vanderbilt. Phys. Rev B, 47:10142, 1993. 

[81] R Giannozzi, F. de AngeUs, and R. Car. J. Chem. Phys., 120:5903, 2004. 

[82] J. A. White and D. M. Bird. Phys. Rev B, 50:4954, 1994. 

[83] A. Dal Corso and A. Mosca Conte. Phys. Rev B, 11:115106, 2005. 

[84] Adriano Mosca Conte. Quantum mechanical modeling ofnano magnetism: new tools based on Density - 
Functional theory with case applications to solids, surfaces, wires, and molecules. PhD thesis, 
SISSA/ISAS, Trieste, Italy, 2007. 

[85] V. 1. Anisimov, J. Zaanen, and O. K. Andersen. Phys. Rev B, 44:943, 1991. 

[86] M. Cococcioni and S. de GironcoU. Phys. Rev B, 71:035105, 2005. 

[87] H. Kwee, S. Zhang, and H. Krakauer Phys. Rev Lett., 100:126404, 2008. 

[88] D. D. Johnson. Phys. Rev. B, 38:12807, 1988. 

[89] D. J. Chadi and M. L. Cohen. Phys. Rev B, 8:5747, 1973. 

[90] H. J. Monkhorst and J.D. Pack. Phys. Rev B, 13:5188, 1976. 

[91] M. Methfessel and A. Paxton. Phys. Rev B, 40:3616, 1989. 

[92] N. Marzari, D. Vanderbilt, A. de Vita, and M. C. Payne. Phys. Rev. Lett., 82:3296, 1999. 
[93] R E. Blochl, O. Jepsen, and O.K. Andersen. Phys. Rev. B, 49:16223, 1994. 
[94] N. David Mermin. P/jy^. /?ev., 137(5A):A1441-A1443, Mar 1965. 



Quantum ESPRESSO 



34 



[95] R.Fletcher. Practical Methods of Optimization. John Wiley and Sons, 1987. 

[96] S. R. Billeter, A. J. Turner, and W. Thiel. Phys. Chem. Chem. Phys., 2:2177, 2000. 

[97] S. R. Billeter, A. Curioni, and W. Andreoni. Comp. Mater. Sci., 27:437, 2003. 

[98] C. Micheletti, A. Laio, and M. Parrinello. Phys. Rev. Lett., 92:170601, 2004. 

[99] L. Verlet. Phys. Rev., 159:98, 1967. 
[100] M. R Allen and D. J. Tildesley. Computer Simulations of Liquids. Clarendon Press, 1986. 
[101] M. Bemasconi, G. L. Chiarotti, P. Focher, S. Scandolo, E. Tosatti, and M. Parrinello. J. Phys. Chem. 

Solids, 56:501-505, 1995. 
[102] R Umari and A. Pasquarello. Phys. Rev Lett, 89:157602, 2002. 

[103] Ivo Souza, Jorge liiiguez, and David Vanderbilt. Phys. Rev Lett., 89(1 1):1 17602, Aug 2002. 
[104] K. Kunc andR. Resta. Phys. Rev. Lett., 51:686, 1983. 
[105] J. Tobik and A. Dal Corso. J. Chem. Phys., 120:9934, 2004. 

[106] Damian A. ScherUs, Jean-Luc Fattebert, Fran9ois Gygi, Matteo Cococcioni, and Nicola Marzari. J. Chem. 
Phys., 124:074103, 2006. 

[107] Ismaila Dabo, Eric Cances, Yanli Li, and Nicola Marzari. First-principles simulation of electrochemical 
systems at fixed applied voltage: Vibrational stark effect for co on platinum electrodes. 2009. 
http://arxiv.org/abs/090 1 .0096v2. 

[108] C. Cavazzoni and G. L. Chiarotti. Comput. Phys. Commun., 123:56, 1999. 

[109] A. Pasquarello, K. Laasonen, R. Car, C. Lee, and D. Vanderbilt. Phys. Rev Lett., 69:1982, 1992. 

[110] M. d'Avezac, M. Calandra, and E Mauri. Phys. Rev. B, 71:205210, 2005. 

[1 1 1] Hoi-Land Sit, Matteo Cococcioni, and Nicola Marzari. J. Electroanal. Chem., 607:107-112, 2007. 
[1 12] Hoi-Land Sit, Matteo Cococcioni, and Nicola Marzari. Phys. Rev. Lett., 97:028303, 2006. 
[113] N. Marzari, D. Vanderbilt, and M. C. Payne. Phys. Rev. Lett., 79:1337, 1997. 
[114] F. Tassone, F. Mauri, and R. Car. Phys. Rev. B, 50:10561, 1994. 
[115] D. J. Tobias, G. J. Martyna, and M. L. Klein. J. Phys. Chem., 97:12959, 1993. 
[116] Glenn J. Martyna, Michael L. Klein, and Mark Tuckerman. J. Chem. Phys., 97(4):2635-2643, 1992. 
[117] M. C. Payne, M. P Teter, D. C. Allen, T. A. Arias, and J. D. Joannopoulos. Rev. Mod. Phys., 64:1045, 
1992. 

[118] D. Marx and J. Hutter. Modem methods and algorithms of quantum chemistry. John von Neumann 

Institute for Computing, FZ Julich, pp. 301-449 (2000). 
[119] V. Dubois, P Umari, and A. Pasquarello. Chem. Phys. Lett., 390:193, 2004. 
[120] F. Giustino and A. Pasquarello. P/iys. /?ev. Lett., 95:187402, 2005. 
[121] P. Umari and A. Pasquarello. Diam. Relat. Mater., 14:1255, 2005. 
[122] R Umari and A. Pasquarello. Phys. Rev Lett., 95:137401, 2005. 
[123] L. Giacomazzi, R Umari, and A. Pasquarello. Phys. Rev Lett., 95:075505, 2005. 
[124] P Umari and A. Pasquarello. Phys. Rev. Lett., 98:176402, 2007. 
[125] R Giannozzi and S. Baroni. J. Chem. Phys., 100:8537, 1994. 
[126] X. Gonze. Phys. Rev B, 55:10337, 1997. 
[127] X. Gonze and C. Lee. Phys. Rev. B, 55:10355, 1997. 

[128] A. Dal Corso, A. Pasquarello, and A. Baldereschi. Phys. Rev. B, 56:R11369, 1997. 

[129] A. Dal Corso. Phys. Rev B,64-:235n8, 2001. 

[130] F. Favot and A. Dal Corso. Phys. Rev B, 60: 1 1427, 1999. 

[131] A. Dal Corso and S. de GironcoU. Phys. Rev. B, 62:273, 2000. 

[132] A. Dal Corso. submitted, 2009. 

[133] A. Dal Corso. Phys. Rev B, 76:054308,2007. 

[134] Feliciano Giustino, Marvin L. Cohen, and Steven G. Louie. Phys. Rev B, 76:165108, 2007. 
[135] M. Lazzeri and S. de Gironcoli. Phys. Rev. B, 65:245402, 2002. 
[136] M. Lazzeri and F Mauri. Phys. Rev Lett, 90:036401, 2003. 
[137] M. Lazzeri and E Mauri. Phys. Rev B,68:\6l\0\{R),2003. 

[138] D. D. KoelUng andB. N. Harmon. J. Phys. C: Solid State Phys., 10(16):3 107-3 114, 1977. 
[139] A. H. MacDonald and S. H. Vosko. J. Phys. C: Solid State Phys., 12(15):2977-2990, 1979. 



Quantum ESPRESSO 



35 



[140] A. K. Rajagopal and J. Callaway. Phys. Rev. B, 7(5):1912-1919, Mar 1973. 
1141] N. TroulUer and Jose Luis Martins. Phys. Rev. B, 43(3): 1993-2006, Jan 1991. 

[142] Andrew M. Rappe, Karin M. Rabe, Efthimios Kaxiras, and J. D. Joannopoulos. Phys. Rev. B, 41(2): 1227- 
1230, Jan 1990. 

[143] G. Kresse and J. Hafner J. Phys.-Condens. Mat, 6:8245, 1994. 

[144] A. Smogunov, A. Dal Corso, and E. Tosatti. Phys. Rev. B, 70:045417, 2004. 

[145] A. Dal Corso, A. Smogunov, and E. Tosatti. Phys. Rev. B, 74:045429, 2006. 

[146] M. Profeta, F. Mauri, and C. J. Pickard. /. Am. Chem. Soc, 125:541, 2003. 

[147] C. G. van de Walle and P E. Blochl. Phys. Rev., B 47:4244, 1993. 

[148] E Mauri and S. G. Louie. Phys. Rev Lett., l&.AlAb, 1996. 

[149] F. Mauri, B. Pfrommer, and S. G. Louie. Phys. Rev. Lett., 77:5300, 1996. 

[150] C. J. Pickard and E Mauri. Phys. Rev Lett., 88:086403, 2002. 

[151] Timo Thonhauser, Davide Ceresoli, Arash Mostofi, Nicola Marzari, Raffaele Resta, and David Vanderbilt. 

Electrostatics in periodic boundary conditions and real-space corrections, submitted to Phys. Rev. Lett., 

2009. http://arxivorg/abs/0709.4429vl . 
[152] C. Gougoussis, M. Calandra, A. Seitsonen, and E. Mauri. Phys. Rev. B, in press, 2009. 

http://arxiv.org/abs/0906.0897. 
[153] C. Gougoussis, M. Calandra, A. Seitsonen, Ch. Brouder, A. Shukla, and E. Mauri. Phys. Rev. B, 

79:045118, 2009. 
[154] http://www.wannier.org/. 

[155] P.L. Silvestrelli, N. Marzari, D. Vanderbilt, andM. Parrinello. Solid State Commun., 107:7, 1998. 
[156] A. Calzolari, I. Souza, N. Marzari, and M. Buongiomo NardeUi. Phys. Rev. B, 69:035108, 2004. 
[157] Young-Su Lee, M. Buongiorno Nardelli, and Nicola Marzari. Phys. Rev. Lett, 95:076804, 2005. 
[158] J. R. Yates, X. Wang, D. Vanderbilt, and I. Souza. Phys. Rev. B, 75:195121, 2007. 
[159] A. Kokalj. Xcrysden-a new program for displaying crystalline structures and electron densities. J.Mol. 

Graph. Model, 17:176-179, 1999. http://www.xcrysden.org/. 
[160] WilUam Humphrey, Andrew Dalke, and Klaus Schulten. VMD - Visual Molecular Dynamics. J. Mol. 

Graph. Model, 14:33-38, 1996. 
[161] http://www.tcm.phy.cam.ac.uk/ mdt26/casino2.html. 

[162] D. Korotin, A. V. Kozhevnikov, S. L. Skornyakov, I. Leonov, N. Binggeli, V. I. Anisimov, and 
G. Trimarchi. Ewr. P/iys. 7. fi, 65:91-98, 2008. http://dmft@qe-forge.org. 

[163] L. Martin-Samos and G. Bussi. Sax: An open source package for electronic- structure and optical- 
properties calculations in the gw approximation. Comp. Phys. Comm., in press, 2009. http://sax- 
project.org. 

[164] http://dp-code.org. 

[165] http ://ww w.bethe- salpeter.org. 

[166] A. Kokalj. Computer graphics and graphical user interfaces as tools in simulations of matter at the atomic 

scale. Comp. Mater Set, 28:155, 2003. http://www-k3.ijs.si/kokalj/guib/. 
[167] P. Giannozzi and C. Cavazzoni. Large-scale computing with quantum-espresso. Nuovo Cimento C, in 

press, 32, 2009. 

[168] Juerg Hutter and Alessandro Curioni. Car-parrineUo molecular dynamics on massively parallel 

computers. ChemPhysChem, 6:1788, 2005. 
[169] L. S. Blackford, J. Choi, A. Cleary, J. Demmel, L Dhillon, J. Dongarra, S. HanmiarUng, G. Henry, 

A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. Scalapack: A portable linear algebra library 

for distributed memory computers - design issues and performance. SC Conference, 0:5, 1996. 
[170] Francois Gygi, Erik W. Draeger, Martin Schulz, Bronis R. de Supinski, John A. Gimnels, Vernon Austel, 

James C. Sexton, Franz Franchetti, Stefan Krai, Christoph W. Ueberhuber, and Juergen Lorenz. IBM 

J. Res. Dev., 52(1/2): 137, 2008. 
[171] Using OpenMP. Portable Shared Memory Parallel Programming. The MIT Press, 2007. 
[172] Erich Runge and E. K. U. Gross. Phys. Rev. Lett, 52(12):997-1000, 1984. 

[173] M. A. L. Marques, C. L. Ullrich, F. Nogueira, A. Rubio, K. Burke, and E. K. U. Gross, editors. Time- 



Quantum ESPRESSO 



36 



Dependent Density Functional Theory, volume 706 of Lecture notes in Physics. Springer- Verlag, 

Berlin, Heidelberg, 2006. DOI-10.1007/3-540-35426-3-17. 
[174] Giovanni Onida, Lucia Reining, and Angel Rubio. Rev. Mod. Phys., 74(2):601-659, 2002. 
[175] D. Rocca, R. Gebauer, Y. Saad, and S. Baroni. J. Chem. Phys., 128:154105, 2008. 
[176] P. Umari, Geoffrey Stenuit, and Stefano Baroni. Phys. Rev. B, 79:201 104(R), 2009. 
[177] H.-V. Nguyen and S. de Gironcoli. Phys. Rev B, 79:205114, 2009. 
[178] H.-V. Nguyen and S. de Gironcoli. Phys. Rev B, 79:115105, 2009. 

[179] H.-V. Nguyen. PhD thesis, SISSA, 2008. http://www.sissa.it/cm/thesis/2008/VietHuyNguyen_PhDthesis.pdf. 
[180] Andrea Marini, Conor Hogan, Myrta Griining, and Daniele Varsano. Yambo: an ab initio tool for excited 

state calculations. Comp. Phys. Commun., in press, 2009. http://www.yambo-code.org. 
[181] A. Calzolari, I. Souza, N. Marzari, and M. Buongiomo Nardelli. Phys. Rev B, 69:035108, 2004. 

http://www.wannier-transport.org. 
[182] http://qha@qe-forge.org. 

[183] A. Kokalj, pwtk: a Tel scripting interface to PWscf, hllp://pwtk.qe-forge.org/. 

1 1841 P da Silveira, C. R. S. da Silva, and R. M. Wentzcovitch. Comput. Phys. Commun., 178:186, 2008. 

[185] Cesar R. S. da Silva, P R. C. da Silveira, B. B. Karki, R. M Wentzcovitch, P A. Jensen, E. F. Bollig, 

M. Pierce, G. Erlebacher, and D. A. Yuen. Phys. Earth Planet. Int., 163:321, 2007. Special Issue: 

Computational Challenges. 
[186] http://dasilveira.cems. umn.edu:8080/gridsphere/gridsphere, http://www.vlab.msi.umn.edu. 
1 187] R. M. Wentzcovitch, B. B. Karki, M. Cococcioni, and S. de GironcoU. Phys. Rev Lett., 92:018501, 2004. 
[188] D. Raczkowski, A. Canning, and L. W. Wang. Phys. Rev B, 64:R121101, 2001. 
[189] T. A. Arias, M. C. Payne, and J. D. Joannopoulos. Phys. Rev B, 45:1538, 1992. 
[190] A. Qteish. Phys. Rev B, 52:14497-14504, 1995. 
[191] C. Alden Mead. Rev Mod Phys., 64:51, 1992. 

[192] R. di Meo, A. Dal Corso, P. Giannozzi, and S. Cozzini. Calculation of phonon dispersions on the grid 

using quantum espresso. In Proceedings of the COST School, Trieste, 2009. to be published. 
[193] F. Gygi and A. Baldereschi. Phys. Rev B, 34:4405, 1986. 



